A Performance Assessment of Model Selection Criteria When the Number of Objects Is Much Larger than the Number of Variables in PLSR

نویسنده

  • Elif Bulut
چکیده

Partial Least Squares Regression (PLSR) is a method for constructing predictive models when the variables are many and highly collinear. Its goal is to predict a set of response variables from a set of predictor variables. This prediction is achieved by extracting a set of orthogonal factors called latent variables from the predictor variables. This study investigated the performances of model selection criteria in selecting the true number of latent variables from PLSR models for data sets that have various observations and variable numbers. Their performances have been compared in terms of the simulation study and 5-fold cross validation. This simulation has been performed for different numbers of predictor variables and different numbers of observation units to compare the performance of two types of Multivariate Akaike Information criterion and three types of Wold’s R criterion in finding the number of true latent variables. The simulation results show that all criteria achieved the true number of latent variables for a small-sized design matrix. It was noticed that when the observation numbers were increased, PLSR worked with a larger number of latent variables, except for some cases. Wold’S R_2 and Wold’S R_3 found less numbers as the number of latent variables.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

PROVIDING A MODEL FOR THE SUPPLIER SELECTION PROCESS IN THE SUPPLY CHAIN MANAGEMENT WITH HYBRID MODEL OF DECISION MAKING

<span style="color: #000000; font-family: Tahoma, sans-serif; font-size: 13px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: auto; text-align: -webkit-left; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px; display: inline !important; float: none; ba...

متن کامل

Feature Selection for Small Sample Sets with High Dimensional Data Using Heuristic Hybrid Approach

Feature selection can significantly be decisive when analyzing high dimensional data, especially with a small number of samples. Feature extraction methods do not have decent performance in these conditions. With small sample sets and high dimensional data, exploring a large search space and learning from insufficient samples becomes extremely hard. As a result, neural networks and clustering a...

متن کامل

PROVIDING A MODEL FOR THE SUPPLIER SELECTION PROCESS IN THE SUPPLY CHAIN MANAGEMENT WITH HYBRID MODEL OF DECISION MAKING

<span style="color: #000000; font-family: Tahoma, sans-serif; font-size: 13px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: auto; text-align: -webkit-left; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px; display: inline !important; float: none; ba...

متن کامل

Investigation of unbalanced magnetic force in permanent magnet brushless dc machines with diametrically asymmetric winding

The purpose of this paper is the calculation of Unbalanced Magnetic Force (UMF) in permanent magnet brushless DC (PMBLDC) machines with diametrically asymmetric winding and investigation of UMF variations in the presence of phase advance angle. This paper presents an analytical model of UMF in surface mounted PMBLDC machines that have a fractional ratio of slot number to pole number. This model...

متن کامل

ارزیابی کارایی نسبی بیمارستان‌های دولتی استان یزد با استفاده از مدل تحلیل پوششی داده‌ها

Introduction: Performance measurement, programming and goal setting for performance improvement are needed for organizations to improve their performance. Despite advancement in performance measurement systems, many organizations still emphasize on old models. Methods: This paper analyzes the efficiency of Yazd governmental hospitals by using DEA model from 2004 to 2006. The inputs in DEA...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013